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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 




Source: /^T Q ? 

Date Processed by STIC: S~ ~ 7 ~ O I 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin21help@usnto.eov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3helpfaiuspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



Sequence Listing Error f^^ i m a r y 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: O ^ ^/ & O 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



Wrapped Nucleics 



Wrapped Aminos 



The numberAext at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

The amino acid number/text at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



3 Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 

A Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 

Numbering between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



6 Variable Length Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

7 Patentln ver. 2.0 "bug" A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sefluence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 



Skipped Sequences 
(NEW RULES) 



10 ^^Useofn' 



's or Xaa's 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listi ng. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



1 1 Use of "Artificial" Use of "Artificial" only as "<21 3> Organism" response is incomplete, per 1 .823(b) of New Sequence Rules. 

(NEW RULES) Valid response is Artificial Sequence. 



12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <2t3>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> lo <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager or any other means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 4/06/2001 



Page 1 of 7 



PCT 



RAW SEQUENCE LISTING DATE: 05/07/2001 

PATENT APPLICATION: US/09/807,949 TIME: 14:23:58 

Input Set : A:\D-021-51.app 

Output Set: N:\CRF3\05072001\I807949.raw 

3 <110> APPLICANT: Zavada, Jan PoVflPty 

4 Pastorekova, Silvia r\nGsN0t u 



5 Pastorek, Jaromir 

7 <120> TITLE OF INVENTION : MN Gene and Protein ww .- - 

9 <130> FILE REFERENCE: D-0021.5 PCT Lf-3 
C--> 11 <140> CURRENT APPLICATION NUMBER: US/09/807,949 pP - 

C--> 12 <141> CURRENT FILING DATE: 2001-04-19 I 

14 <150> PRIOR APPLICATION NUMBER: 09/177,776 

15 <151> PRIOR FILING DATE: 1998-10-23 

17 <150> PRIOR APPLICATION NUMBER: 09/178,115 

18 <151> PRIOR FILING DATE: 1998-10-23 
20 <160> NUMBER OF SEQ ID NOS : 14 3 

22 <170> SOFTWARE: Patentln Ver . 2.1 

24 <210> SEQ ID NO: 1 

25 <211> LENGTH: 1522 

26 <212> TYPE: DNA 

27 <213> ORGANISM: HUMAN 

29 <220> FEATURE: 

30 <221> NAME/KEY: CDS 

31 <222> LOCATION: (13).. (1389) 

33 <220> FEATURE: 

34 <221> NAME/KEY: mat_peptide 

35 <222> LOCATION: ( 124 )..( 1389 ) 

37 <400> SEQUENCE: 1 

38 acagtcagcc gc atg get ccc ctg tgc ccc age ccc tgg etc cct ctg ttg 51 

39 Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu 

40 -35 -30 -25 

42 ate ccg gec cct get cca ggc etc act gtg caa ctg ctg ctg tea ctg 99 

4 3 lie Pro Ala Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu 

44 -20 . ' -15 -10 

46 ctg ctt ctg atg cct gtc cat ccc cag agg ttg ccc egg atg cag gag 147 

4 7 Leu Leu Leu Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu 

48 -5 -1 1 5 

50 gat tec ccc ttg gga gga ggc tct tct ggg gaa gat gac cca ctg ggc 195 

51 Asp Ser Pro Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly 

52 10 15 20 

54 gag gag gat ctg ccc agt gaa gag gat tea ccc aga gag gag gat cca 24 3 

55 Glu Glu Asp Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro 

56 25 30 35 40 

58 ccc gga gag gag gat eta cct gga gag gag gat eta cct gga gag gag 291 

59 Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu 

60 45 50 55 

62 gat eta cct gaa gtt aag cct aaa tea gaa gaa gag ggc tec ctg aag 3 39 

63 Asp Leu Pro Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys 

64 60 65 70 

66 tta gag gat eta cct act gtt gag get cct gga gat cct caa gaa ccc 387 

67 Leu Glu Asp Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro 
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154 tagaggctgg atcttggaga atgtgagaag ccagccagag gcatctgagg gggagccggt 144 9 
156 aactgtcctg tcctgctcat tatgecaett ccttttaact gecaagaaat tttttaaaat 1509 
158 aaatatttat aat 1522 

161 <210> SEQ ID NO: 2 

162 <211> LENGTH: 459 

163 <212> TYPE: PRT 

164 <213> ORGANISM: HUMAN 
166 <400> SEQUENCE: 2 
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206 Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin 

207 175 180 185 

209 Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr 

210 190 195 200 

212 Val Glu Gly His Arg Phe Pro Ala Glu lie His Val Val His Leu Ser 

213 205 210 215 

215 Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu 

216 220 225 230 235 

218 Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala 

219 240 245 250 

221 Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala Glu Glu Gly Ser 

222 255 260 265 

224 Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu Leu Pro Ser Asp 

225 270 275 280 

227 Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys 

228 285 290 295 

230 Ala Gin Gly Val He Trp Thr Val Phe Asn Gin Thr Val Met Leu Ser 

231 300 305 310 315 
' 233 Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp 

234 320 325 330 

236 Ser Arg Leu Gin Leu Asn Phe Arg Ala Thr Gin Pro Leu Asn Gly Arg 

237 335 340 345 

239 Val He Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala 

240 350 355 360 

242 Ala Glu Pro Val Gin Leu Asn Ser Cys Leu Ala Ala Gly Asp He Leu 

243 365 370 375 

24 5 Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala Phe Leu 
246 380 385 390 395 

24 8 Val Gin Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser 
249 400 405 410 

251 Tyr Arg Pro Ala Glu Val Ala Glu Thr Gly Ala 

252 415 420 

256 <210> SEQ ID NO: 3 

257 <211> LENGTH: 29 

258 <212> TYPE: DNA 

259 <213> ORGANISM: HUMAN 

261 <400> SEQUENCE: 3 

262 cgcccagtgg gtcatcttcc ccagaagag 

265 <210> SEQ ID NO: 4 

266 <211> LENGTH: 19 

267 <212> TYPE: DNA 

26 8 <213> ORGANISM: HUMAN 

270 <400> SEQUENCE: 4 

271 ggaatcctcc tgcatccgg 

274 <210> SEQ ID NO: 5 

275 <211> LENGTH: 10898 

276 <212> TYPE: DNA 

277 <213> ORGANISM: HUMAN 
279 <220> FEATURE: 



29 



/a/ a 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/807,949 



DATE 
TIME 



05/07/2001 
14 :23: 59 



Input Set : A:\D-021-51.app 

Output Set: N:\CRF3\05072001\I807949.raw 



L:ll M:270 C: Current Application Number differs, Replaced Application Number 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:316 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID#:5 
L:316 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:512 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID#:9 
L:512 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:9 

L:688 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID#:25 
L:688 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:25 

L:702 M:258 W: Mandatory Feature missing, <223> not found for SEQ IDS: 26 
L:702 M:341 W: (46) "n n or "Xaa" used, for SEQ ID#:26 

L:1371 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID#:58 
L:1371 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:58 

L:1804 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID#:90 
L:1804 M:341 W: (46) "n" or "Xaa" used, for SEQ IDft:90 

L:2064 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID# : 110 
L:2064 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 110 
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RAW SEQUENCE LISTING DATE: 05/07/2001 

PATENT APPLICATION: US/09/807,949 TIME: 14:23:58 

Input Set : A:\D-021-51.app 

Output Set: N:\CRF3\05072001\I807949.raw 

280 <221> NAME/KEY: gene 

281 <222> LOCATION: (1). .{10898) 

283 "<400> SEQUENCE: 5 

284 ggatcctgtt gactcgtgac cttaccccca accctgtgct ctctgaaaca tgagctgtgt 60 

285 ccactcaggg ttaaatggat taagggcggt gcaagatgtg ctttgttaaa cagatgcttg 120 

286 aaggcagcat gctcgttaag agtcatcacc aatccctaat ctcaagtaat cagggacaca 180 

287 aacactgcgg aaggccgcag ggtcctctgc ctaggaaaac cagagacctt tgttcacttg 24 0 

288 tttatctgac cttccctcca ctattgtcca tgaccctgcc aaatccccct ctgtgagaaa 300 

289 cacccaagaa ttatcaataa aaaaataaat ttaaaaaaaa aatacaaaaa aaaaaaaaaa 360 

290 aaaaaaaaaa gacttacgaa tagttattga taaatgaata gctattggta aagccaagta 4 20 

291 aatgatcata ttcaaaacca gacggccatc atcacagctc aagtctacct gatttgatct 480 

292 ctttatcatt gtcattcttt ggattcacta gattagtcat catcctcaaa attctccccc 540 

293 aagttctaat tacgttccaa acatttaggg gttacatgaa gcttgaacct actaccttct 600 

294 ttgcttttga gccatgagtt gtaggaatga tgagtttaca ccttacatgc tggggattaa 660 

295 tttaaacttt acctctaagt cagttgggta gcctttggct tatttttgta gctaattttg 720 

296 tagttaatgg atgcactgtg aatcttgcta tgatagtttt cctccacact ttgccactag 7 80 

297 gggtaggtag gtactcagtt ttcagtaatt gcttacctaa gaccctaagc cctatttctc 840 

298 ttgtactggc ctttatctgt aatatgggca tatttaatac aatataattt ttggagtttt 900 

299 tttgtttgtt tgtttgtttg tttttttgag acggagtctt gcatctgtca tgcccaggct 9 60 

300 ggagtagcag tggtgccatc tcggctcact gcaagctcca cctcccgagt tcacgccatt 1020 

301 ttcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccaccat gcccggctaa 1080 

302 ttttttgtat ttttggtaga gacggggttt caccgtgtta gccagaatgg tctcgatctc 1140 

303 ctgacttcgt gatccacccg cctcggcctc ccaaagttct gggattacag gtgtgagcca 1200 

304 ccgcacctgg ccaatttttt gagtctttta aagtaaaaat atgtcttgta agctggtaac 1260 

305 tatggtacat ttccttttat taatgtggtg ctgacggtca tataggttct tttgagtttg 1320 

306 gcatgcatat gctacttttt gcagtccttt cattacattt ttctctcttc atttgaagag 1380 

307 catgttatat cttttagctt cacttggctt aaaaggttct ctcattagcc taacacagtg 14 40 

308 tcattgttgg taccacttgg atcataagtg gaaaaacagt caagaaattg cacagtaata 1500 

309 cttgtttgta agagggatga ttcaggtgaa tctgacacta agaaactccc ctacctgagg 1560 

310 tctgagattc ctctgacatt gctgtatata ggcttttcct ttgacagcct gtgactgcgg 1620 

311 actatttttc ttaagcaaga tatgctaaag ttttgtgagc ctttttccag agagaggtct 1680 

312 catatctgca tcaagtgaga acatataatg tctgcatgtt tccatatttc aggaatgttt 1740 

313 gcttgtgttt tatgctttta tatagacagg gaaacttgtt cctcagtgac ccaaaagagg 1800 

314 tgggaattgt tattggatat catcattggc ccacgctttc tgaccttgga aacaattaag 1860 y 

315 ggttcataat ctcaattctg tcagaattgg tacaagaaat agctgctatg tttcttgaca 1920 7^ 
W--> 316 ttccacttgg taggaaataa gaatgtgaaa ctcttcagtt ggtgtgtgtc cc^Qfttttt 1980 ^ " a 

317 ttgcaatttc cttcttactg tgttaaaaaa aagtatgatc ttgctctgag aggtgaggca 2040 , / Q **? U 

318 ttcttaatca tgatctttaa agatcaataa tataatcctt tcaaggatta tgtctttatt 2100 poS tf f ' oin 

319 ataataaaga taatttgtct ttaacagaat caataatata atcccttaaa ggattatatc 2160 / 

320 tttgctgggc gcagtggctc acacctgtaa tcccagcact ttgggtggcc aaggtggaag 2220 

321 gatcaaattt gcctacttct atattatctt ctaaagcaga attcatctct cttccctcaa 2280 _f<?<? / 0 ^ r 



322 tatgatgata ttgacagggt ttgccctcac tcactagatt gtgagctcct gctcagggca 234 0 

323 ggtagcgttt tttgtttttg tttttgtttt tcttttttga gacagggtct tgctctgtca 2400 /°^' 

324 cccaggccag agtgcaatgg tacagtctca gctcactgca gcctcaaccg cctcggctca 24 60 

325 aaccatcatc ccatttcagc ctcctgagta gctgggacta caggcacatg ccattacacc 2520 

326 tggctaattt ttttgtattt ctagtagaga cagggtttgg ccatgttgcc cgggctggtc 2580 

327 tcgaactcct ggactcaagc aatccaccca cctcagcctc ccaaaatgag ggaccgtgtc 264 0 

328 ttattcattt ccatgtccct agtccatagc ccagtgctgg acctatggta gtactaaata 2700 

329 aatatttgtt gaatgcaata gtaaatagca tttcagggag caagaactag attaacaaag 2760 



c 



Please Note: 
Use of n and/ 

Sequence Lis.....-, . 

7^ <223> fields of each sequence which presents at least one n or Xaa. 



1 Please Note: . 4 . 

J V < Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
^.1 ^ N Sequence Listing to ensure that a corresponding explanation ^presented in the <220> to 
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